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The ’Basic’ HTTP Authentication Scheme 


Abstract 
This document defines the "Basic" Hypertext Transfer Protocol (HTTP) 
authentication scheme, which transmits credentials as user-id/ 
password pairs, encoded using Base64. 

Status of This Memo 


This is an Internet Standards Track document. 


This document is a product of the Internet Engineering Task Force 


(IETF). It represents the consensus of the IETF community. It has 
received public review and has been approved for publication by the 
Internet Engineering Steering Group (IESG). Further information on 


Internet Standards is available in Section 2 of RFC 5741. 
Information about the current status of this document, any errata, 


and how to provide feedback on it may be obtained at 
http://www.rfc-editor.org/info/rfc7617. 
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1. Introduction 


This document defines the "Basic" Hypertext Transfer Protocol (HTTP) 
authentication scheme, which transmits credentials as user-id/ 
password pairs, encoded using Base64 (HTTP authentication schemes are 
defined in [RFC7235]). 


This scheme is not considered to be a secure method of user 
authentication unless used in conjunction with some external secure 
system such as TLS (Transport Layer Security, [RFC5246]), as the 
user-id and password are passed over the network as cleartext. 


The "Basic" scheme previously was defined in Section 2 of [RFC2617]. 
This document updates the definition, and also addresses 
internationalization issues by introducing the ’charset’ 
authentication parameter (Section 2.1). 


Other documents updating RFC 2617 are "Hypertext Transfer Protocol 
(HTTP/1.1): Authentication" ([RFC7235], defining the authentication 
framework), "HTTP Digest Access Authentication" ([RFC7616], updating 
the definition of the "Digest" authentication scheme), and "HTTP 
Authentication-Info and Proxy-Authentication-Info Response Header 


Fields" ([RFC7615]). Taken together, these four documents obsolete 
RFC 2617. 
1.1. Terminology and Notation 


The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", 
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this 
document are to be interpreted as described in [RFC2119]. 


The terms "protection space" and "realm" are defined in Section 2.2 
of [RFC7235]. 


The terms "(character) repertoire" and "character encoding scheme" 
are defined in Section 2 of [RFC6365]. 


2. The ’Basic’ Authentication Scheme 


The Basic authentication scheme is based on the model that the client 
needs to authenticate itself with a user-id and a password for each 
protection space ("realm"). The realm value is a free-form string 
that can only be compared for equality with other realms on that 
server. The server will service the request only if it can validate 
the user-id and password for the protection space applying to the 
requested resource. 
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The Basic authentication scheme utilizes the Authentication Framework 
as follows. 


In challenges: 
o The scheme name is "Basic". 


o The authentication parameter ’realm’ is REQUIRED ([RFC7235], 
Section 2.2). 


o The authentication parameter ’charset’ is OPTIONAL (see 
Section 2.1). 


o No other authentication parameters are defined -- unknown 
parameters MUST be ignored by recipients, and new parameters can 
only be defined by revising this specification. 


See also Section 4.1 of [RFC7235], which discusses the complexity of 
parsing challenges properly. 


Note that both scheme and parameter names are matched case- 
insensitively. 


For credentials, the "token68" syntax defined in Section 2.1 of 
[RFC7235] is used. The value is computed based on user-id and 
password as defined below. 


Upon receipt of a request for a URI within the protection space that 
lacks credentials, the server can reply with a challenge using the 
401 (Unauthorized) status code ([RFC7235], Section 3.1) and the 
WWW-Authenticate header field ([RFC7235], Section 4.1). 


For instance: 
HTTP/1.1 401 Unauthorized 
Date: Mon, 04 Feb 2014 16:50:53 GMT 
WWW-Authenticate: Basic realm="WallyWorld" 


where "WallyWorld" is the string assigned by the server to identify 
the protection space. 


A proxy can respond with a similar challenge using the 407 (Proxy 


Authentication Required) status code ([RFC7235], Section 3.2) and the 
Proxy-Authenticate header field ([RFC7235], Section 4.3). 
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To receive authorization, the client 


1. obtains the user-id and password from the user, 

2. constructs the user-pass by concatenating the user-id, a single 
colon (":") character, and the password, 

3. encodes the user-pass into an octet sequence (see below for a 


discussion of character encoding schemes), 


4. and obtains the basic-credentials by encoding this octet sequence 
using Base64 ([RFC4648], Section 4) into a sequence of US-ASCII 
characters ([RFC0020]). 


The original definition of this authentication scheme failed to 
specify the character encoding scheme used to convert the user-pass 
into an octet sequence. In practice, most implementations chose 
either a locale-specific encoding such as ISO-8859-1 ([ISO-8859-1]), 
or UTF-8 ([RFC3629]). For backwards compatibility reasons, this 
specification continues to leave the default encoding undefined, as 
long as it is compatible with US-ASCII (mapping any US-ASCII 
character to a single octet matching the US-ASCII character code). 


The user-id and password MUST NOT contain any control characters (see 
"CTL" in Appendix B.1 of [RFC5234]). 


Furthermore, a user-id containing a colon character is invalid, as 
the first colon in a user-pass string separates user-id and password 
from one another; text after the first colon is part of the password. 
User-ids containing colons cannot be encoded in user-pass strings. 


Note that many user agents produce user-pass strings without checking 
that user-ids supplied by users do not contain colons; recipients 


will then treat part of the username input as part of the password. 


If the user agent wishes to send the user-id "Aladdin" and password 
"open sesame", it would use the following header field: 


Authorization: Basic QWxhZGRpbjpvcGVulIHN1lc2FtZQ== 
2.1. The ’charset’ auth-param 
In challenges, servers can use the ’charset’ authentication parameter 
to indicate the character encoding scheme they expect the user agent 


to use when generating "user-pass" (a sequence of octets). This 
information is purely advisory. 
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The only allowed value is "UTF-8"; it is to be matched case- 
insensitively (see [RFC2978], Section 2.3). It indicates that the 
server expects character data to be converted to Unicode 
Normalization Form C ("NFC"; see Section 3 of [RFC5198]) and to be 
encoded into octets using the UTF-8 character encoding scheme 
([RFC3629]). 


For the user-id, recipients MUST support all characters defined in 
the "UsernameCasePreserved" profile defined in Section 3.3 of 
[RFC7613], with the exception of the colon (":") character. 


For the password, recipients MUST support all characters defined in 
the "OpaqueString" profile defined in Section 4.2 of [RFC7613]. 


Other values are reserved for future use. 


Note: The ‘charset’ is only defined on challenges, as Basic 
authentication uses a single token for credentials (’token68’ 
syntax); thus, the credentials syntax isn’t extensible. 


Note: The name ’charset’ has been chosen for consistency with 
Section 2.1.1 of [RFC2831]. A better name would have been 
"accept-charset’, as it is not about the message it appears in, 
but the server’s expectation. 


In the example below, the server prompts for authentication in the 
"foo" realm, using Basic authentication, with a preference for the 
UTF-8 character encoding scheme: 

WwWW-Authenticate: Basic realm="foo", charset="UTF-8" 
Note that the parameter value can be either a token or a quoted 
string; in this case, the server chose to use the quoted-string 
notation. 
The user’s name is "test", and the password is the string "123" 
followed by the Unicode character U+00A3 (POUND SIGN). Using the 


character encoding scheme UTF-8, the user-pass becomes: 


hoe het et feet KAT, i ee DT ZN pound 
74 65 73 74 3A 31 32 33 C2 A3 


Encoding this octet sequence in Base64 ([RFC4648], Section 4) yields: 


dGVzdDoxMjPCow== 
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Thus, the Authorization header field would be: 
Authorization: Basic dGVzdDoxMjPCow== 
Or, for proxy authentication: 
Proxy-Authorization: Basic dGVzdDoxMjPCow== 
2.2. Reusing Credentials 


Given the absolute URI ([RFC3986], Section 4.3) of an authenticated 
request, the authentication scope of that request is obtained by 
removing all characters after the last slash ("/") character of the 
path component ("hier_part"; see [RFC3986], Section 3). A client 
SHOULD assume that resources identified by URIs with a prefix-match 
of the authentication scope are also within the protection space 
specified by the realm value of that authenticated request. 


A client MAY preemptively send the corresponding Authorization header 
field with requests for resources in that space without receipt of 
another challenge from the server. Similarly, when a client sends a 
request to a proxy, it MAY reuse a user-id and password in the Proxy- 
Authorization header field without receiving another challenge from 
the proxy server. 


For example, given an authenticated request to: 


http://example.com/docs/index.html 

requests to the URIs below could use the known credentials: 
http://example.com/docs/ 
http://example.com/docs/test.doc 
http://example.com/docs/?page=1 

while the URIs 


http://example.com/other/ 
https: //example.com/docs/ 


would be considered to be outside the authentication scope. 
Note that a URI can be part of multiple authentication scopes (such 
as "http://example.com/" and "http://example.com/docs/"). This 


specification does not define which of these should be treated with 
higher priority. 
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3. Internationalization Considerations 


User-ids or passwords containing characters outside the US-ASCII 
character repertoire will cause interoperability issues, unless both 
communication partners agree on what character encoding scheme is to 
be used. Servers can use the new ’charset’ parameter (Section 2.1) 
to indicate a preference of "UTF-8", increasing the probability that 
clients will switch to that encoding. 


The ’realm’ parameter carries data that can be considered textual; 
however, [RFC7235] does not define a way to reliably transport non- 
US-ASCII characters. This is a known issue that would need to be 
addressed in a revision to that specification. 


4. Security Considerations 


The Basic authentication scheme is not a secure method of user 
authentication, nor does it in any way protect the entity, which is 
transmitted in cleartext across the physical network used as the 
carrier. HTTP does not prevent the addition of enhancements (such as 
schemes to use one-time passwords) to Basic authentication. 


The most serious flaw of Basic authentication is that it results in 
the cleartext transmission of the user’s password over the physical 
network. Many other authentication schemes address this problem. 


Because Basic authentication involves the cleartext transmission of 
passwords, it SHOULD NOT be used (without enhancements such as HTTPS 
[RFC2818]) to protect sensitive or valuable information. 


A common use of Basic authentication is for identification purposes 
-- requiring the user to provide a user-id and password as a means of 
identification, for example, for purposes of gathering accurate usage 
statistics on a server. When used in this way it is tempting to 
think that there is no danger in its use if illicit access to the 
protected documents is not a major concern. This is only correct if 
the server issues both user-id and password to the users and, in 
particular, does not allow the user to choose his or her own 
password. The danger arises because naive users frequently reuse a 
single password to avoid the task of maintaining multiple passwords. 


If a server permits users to select their own passwords, then the 
threat is not only unauthorized access to documents on the server but 
also unauthorized access to any other resources on other systems that 
the user protects with the same password. Furthermore, in the 
server’s password database, many of the passwords may also be users’ 
passwords for other sites. The owner or administrator of such a 
system could therefore expose all users of the system to the risk of 
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unauthorized access to all those other sites if this information is 
not maintained in a secure fashion. This raises both security and 
privacy concerns ([RFC6973]). If the same user-id and password 
combination is in use to access other accounts, such as an email or 
health portal account, personal information could be exposed. 


Basic authentication is also vulnerable to spoofing by counterfeit 
servers. If a user can be led to believe that she is connecting to a 
host containing information protected by Basic authentication when, 
in fact, she is connecting to a hostile server or gateway, then the 
attacker can request a password, store it for later use, and feign an 
error. Server implementers ought to guard against this sort of 
counterfeiting; in particular, software components that can take over 
control over the message framing on an existing connection need to be 
used carefully or not at all (for instance: NPH ("Non-Parsed Header") 
scripts as described in Section 5 of [RFC3875]). 


Servers and proxies implementing Basic authentication need to store 
user passwords in some form in order to authenticate a request. 
These passwords ought to be stored in such a way that a leak of the 
password data doesn’t make them trivially recoverable. This is 
especially important when users are allowed to set their own 
passwords, since users are known to choose weak passwords and to 
reuse them across authentication realms. While a full discussion of 
good password hashing techniques is beyond the scope of this 
document, server operators ought to make an effort to minimize risks 
to their users in the event of a password data leak. For example, 
servers ought to avoid storing user passwords in plaintext or as 
unsalted digests. For more discussion about modern password hashing 
techniques, see the "Password Hashing Competition" 
(<https://password-hashing.net>). 


The use of the UTF-8 character encoding scheme and of normalization 
introduces additional security considerations; see Section 10 of 
[RFC3629] and Section 6 of [RFC5198] for more information. 

5. IANA Considerations 
IANA maintains the "Hypertext Transfer Protocol (HTTP) Authentication 
Scheme Registry" ([RFC7235]) at <http://www.iana.org/assignments/ 
http-authschemes>. 


The entry for the "Basic" authentication scheme has been updated to 
reference this specification. 
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Appendix A. Changes from RFC 2617 


The scheme definition has been rewritten to be consistent with newer 
specifications such as [RFC7235]. 


The new authentication parameter ’charset’ has been added. It is 
purely advisory, so existing implementations do not need to change, 
unless they want to take advantage of the additional information that 
previously wasn’t available. 


Appendix B. Deployment Considerations for the ’charset’ Parameter 
B.1. User Agents 


User agents not implementing ’charset’ will continue to work as 
before, ignoring the new parameter. 


User agents that already default to the UTF-8 encoding implement 
‘charset’ by definition. 


Other user agents can keep their default behavior and switch to UTF-8 
when seeing the new parameter. 


B.2. Servers 


Servers that do not support non-US-ASCII characters in credentials do 
not require any changes to support ’charset’. 


Servers that need to support non-US-ASCII characters, but cannot use 
the UTF-8 character encoding scheme will not be affected; they will 
continue to function as well or as badly as before. 


Finally, servers that need to support non-US-ASCII characters and can 
use the UTF-8 character encoding scheme can opt in by specifying the 
‘charset’ parameter in the authentication challenge. Clients that do 
understand the ’charset’ parameter will then start to use UTF-8, 
while other clients will continue to send credentials in their 
default encoding, broken credentials, or no credentials at all. 

Until all clients are upgraded to support UTF-8, servers are likely 
to see both UTF-8 and "legacy" encodings in requests. When 
processing as UTF-8 fails (due to a failure to decode as UTF-8 or a 
mismatch of user-id/password), a server might try a fallback to the 
previously supported legacy encoding in order to accommodate these 
legacy clients. Note that implicit retries need to be done 
carefully; for instance, some subsystems might detect repeated login 
failures and treat them as a potential credentials-guessing attack. 
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B.3. Why not simply switch the default encoding to UTF-8? 


There are sites in use today that default to a local character 
encoding scheme, such as ISO-8859-1 ([ISO-8859-1]), and expect user 
agents to use that encoding. Authentication on these sites will stop 
working if the user agent switches to a different encoding, such as 
UTF-8. 


Note that sites might even inspect the User-Agent header field 
(IRFC7231], Section 5.5.3) to decide which character encoding scheme 
to expect from the client. Therefore, they might support UTF-8 for 
some user agents, but default to something else for others. User 
agents in the latter group will have to continue to do what they do 
today until the majority of these servers have been upgraded to 
always use UTF-8. 
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